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Abstract. We report on the exploitation of the stellar content of the Hamburg/ESO 
objective prism survey by quantitative selection methods. 



1 Introduction 



The Hamburg/ESO survey (HES; Wisotzki et al. 1996; Rcimers and Wisotzki 



1997; Wisotzki et al. 2000) covers the total southern (i5 < 2°. 5) extragalactic 
(\b\ > 30°) sky in the magnitude range 13.0 > Bj > 17.5. It is primarily 
aiming at finding bright quasars. However, at its spectral resolution of typi- 
cally 15 A FWHM at H7, it is also possible to efficiently select an abundance 
of interesting stellar objects. These include, e.g., metal-poor halo field stars, 
carbon stars, cataclysmic variable stars (CVs), white dwarfs (WDs), subd- 
warf B stars (sdBs), subdwarf O stars (sdOs), and field horizontal branch 
A- and B-type stars (FHB/A). Example spectra of some of these stars are 
displayed in Fig. ^. 

Christlicb (2000) has developed quantitative object selection methods, 
such as automatic classification, for the systematic exploitation of the stellar 
content of the HES. In this paper we describe the methods of automatic 
classification used, and report on results obtained so far. 



2 Automatic spectral classification 

Each of the ~ 10 million HES spectra can be represented by a feature vector 
x. A number of features are automatically detected in the extracted and wave- 



length calibrated HES spectra (Christlicb et al. 2001). For stellar work, the 



available features include equivalent widths of stellar absorption and emission 
lines, line indices for C2 and CN bands, principal components of continua, 
broad band (U — B, B — V) and intermediate band (Stromgren c\) colours. 
The colours can be derived directly from HES spectra with accuracies of 
<7 u—B = 0.092 mag, ob-v — 0.095 mag, a Cl = 0.15 mag. 

The goal of automatic classification in the HES is to identify objects 
of a certain class in the large data base. That is, we want to construct a 
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Fig. 1. Definition of HES area (framed) and numbers of ESO/SERC fields in which 
the exploitation of the stellar content of the HES is currently carried out 



decision rule which allows to assign a spectrum with feature vector x to 
one of the n c classes Qj, j = 1 . . .n c , defined in the specific classification 
context. This is called supervised classification, as opposed to un- supervised 
classification, where the aim is to group objects into classes not defined before 
the classification process. 

For supervised classification a learning sample is always needed. For our 
purposes, we define a learning sample to be a set of ni s objects for which the 
feature vectors are known, 

{ x } [xx j • • • ; x n ts ) j 

and for which the real classes are known. The real classes can be defined 
e.g. by grouping a set of objects according to their stellar parameters (e.g. 
T e g, log g, [Fe/H]), or by assigning classes to a set of spectra by comparison 
with reference objects. With the help of a learning sample, information on the 
class-conditional probability densities p(x | fij ) can be gained. p(x\f2j)dx is the 
probability to observe a feature vector in the range x . . . x+dx, given the class 
flj . Experience has shown that in most HES applications it is appropriate to 
model p(x\Qj) by multivariate normal distributions. 

In many applications of automatic spectral classification in the HES, it 
is not possible to generate a large enough learning sample from real spectra 
present on HES plates. This is because usually the target objects are very 
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Fig. 2. HES example spectra. Top panel: DQ white dwarf (left), cool carbon star 
(right); second panel: DB white dwarf, PG 1159 star; third panel: cataclysmic vari- 
able star, extremely metal-poor star, showing a very weak Ca K line; lower panel: 
FHB/A star, cool DA white dwarf. Ordinates are photographic densities, abscissae 
wavelengths in Angstr"om. Note that wavelength is decreasing from left to right. 
The sharp cutoff at ~ 5400 A is due to the Ilia- J emulsion sensitivity cutoff ( "red 
edge") 
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rare. Therefore, we have developed methods to generate artificial learning 



samples by simulations, using either model spectra, or slit spectra (Christliet 



et al. 2001) 



2.1 Bayes' rule classification 

Classification with Bayes' rule minimizes the total number of misclassifica- 
tions, if the true distribution of class-conditional probabilities p(x\f2i) is used 



(Hand 1981; Anderson 1984). Using Bayes' theorem, 

P{Q i )p{x\Q l ) 



'£p(n i )p(x\n i 



posterior probabilities P(S2i\x) can be calculated. A spectrum of unknown 
class, with given feature vector x, can then be classified using Bayes' rule: 

Bayes' rule: Assign a spectrum with feature vector x to the class with the 
highest posterior probability p(fti\x). 

The achievable accuracy of any automatic spectral classification algorithm 
always depends on the signal-to-noise ratio (S/N) of the data used. In the 
HES, the accuracies for spectra in the colour range 0.3 < B — V < 0.7, 
with S/N > 10 (typically corresponding to B < 16.5), are orcf! < 400 K 
(or < 1.6 MK types), CTi ogg < 0.68 dex (or < 0.55 luminosity classes) and 
"■[Fe/H] < 0.68 dex. The classification accuracy in [Fe/H] strongly depends on 
[Fe/H] itself, and is much better than 0.68 dex for [Fe/H] > —2.0. For cooler 
stars (T c ff < 5200 K) not yet covered by our learning sample, the accuracy of 
the luminosity classification is expected to be lower, since the sensitivity of 
ci to gravity is higher in hotter stars. 



2.2 Minimum cost rule classification 

In many of the classification problems arising in the HES it is desired to com- 
pile a sample of objects of a specific class, or a specific set of classes. In these 
cases, Bayes' rule is not appropriate, because we do not want to minimize 
the total number of misclassifications, but the misclassifications between the 
desired class(es) of objects, and the remaining classes. Suppose we have three 
classes, A-, F-, and G-type stars, and we want to compile a complete sample 
of A-type stars. Then only misclassifications between A-type stars and F- 
and G-type stars (and vice versa) are of interest. More specifically, misclassi- 
fications of A-type stars to F- and G-type stars (leading to incompleteness) 
are least desirable when a complete sample shall be compiled, and erroneous 
classification of F- and G-type stars as A-type stars (resulting in sample con- 
tamination) can be accepted at a moderate rate. Misclassifications between 
F- and G-type stars can be totally ignored, because the target object type is 
not involved. 
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Classification aims like this can be realized by using a minimum cost rule. 
Cost factors rhk, with 

< r hk < 1; h=l,...,n c ; k=l,...,n c , (1) 

allow to assign relative weights to individual types of misclassifications. The 
cost factor rhk is the relative weight of a misclassification from class fih to 
class 

Suppose we have an object of unknown class, with feature vector x. We 
ask how large the cost is, if it belongs to class f2h, and would be assigned to 
class fik, h k. The cost Ch^k{x) is: 

C h ^ k {x) =r hk P(n h \x). (2) 

We do not know to which of the possible classes Oh, h — l,...,n c , the 
object actually belongs. Therefore, we estimate the expected cost Ck{x) for 
assigning an object with feature vector x to the class f2 k by computing the 
following sum of costs: 

C k {x) = Y J C h ^k{x) 

h=l 

n c 

= J^r hk P(n h \x). (3) 

h = l 
h^k 

Now we can formulate the minimum cost rule, which minimizes the total cost 
( |Hand 198l[ ). 

Minimum Cost Rule: Assign an object with feature vector x to the class 
J?fc with the lowest expected cost Ck{x). 

If the cost factors are chosen such that r^k = &hk, the minimum cost rule 
classification is identical to classification according to Bayes' rule. In this case 
the cost for assigning the class Q k to a spectrum with feature vector x is the 
probability that the object belongs to one of the other classes h ^ k. This fol- 
lows immediately from (||) . If rhk ^ 5hk , the total number of misclassifications 
is not minimized, so that the quality of a minimum cost rule classification 
has to be evaluated by other criteria, such as sample completeness for a given 
contamination. 

3 First results 

We briefly report on first results from the HES stellar work. More details 
can be found in the paper series "The stellar content of the Hamburg/ESO 
objective prism survey" , which is currently being published in A&A, and in 
phristlieb (20001 ). 
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3.1 Metal-poor stars 

Spectroscopic follow-up observations of 58 candidate metal-poor halo stars 
selected by automatic spectral classification in the HES showed that this 
selection has a more than three times higher efficiency then the selection in 
the only other spectroscopic wide angle survey for such stars, the so-called HK 



survey ( Beers et al. 1992 ; Beers 1999] ). The effective yield of turnoff stars with 



[Fe/H] < -2.0 is 80 % in the HES, but only 22 % in the HK survey on average. 
This is very remarkable considering the fact that the spectral resolution of 
the HES (~ 10 A at Ca K) is two times lower than in the HK survey (~ 5 A). 
The advantages of the HES are: (a) broader wavelength coverage, (b) better 
quality of the spectra, and (c) automated candidate selection procedures, as 
opposed to visual inspection of objective prism plates in the HK survey. 

In spectroscopic follow-up campaigns of metal-poor stars carried out so 
far, 90 metal-poor stars were discovered; 11 are unevolved stars with [Fe/H] < 
—3.0. Since in the HK survey 37 stars with [Fe/H] < —3.0 and 0.3 < 
(B — V)o < 0.5 were found, the sample of unevolved, extremely metal-poor 
stars was already increased noticeably. First abundance analysis using high 
resolution spectra obtained with UVES, the high-resolution spectrograph at- 
tached to VLT-UT2, was recently published ( Depagne et al. 2000] ). We plan to 



use the multi-fiber spectrograph 6dF at the UK Schmidt telescope to follow- 
up the thousands of metal-poor candidates that were selected in the HES, in 
order to provide more of these interesting targets for high resolution studies 
using 10 m class telescopes. 



3.2 Carbon stars 

On the 329 HES plates used so far for stellar work (effective area ~ 6 400 deg 2 , 
or 87% of the HES), 351 carbon stars where identified. The mean surface 
density detected by the HES hence is 0.055 deg ~ 2 , which is almost a factor 



three higher than the surface density found by Green et al. (1994 ) in their 
photometric CCD survey. Moreover, the survey of Green et al. is ~ 1.5 mag 
deeper than the HES (V\i m ~ 16.5 in the HES; Vu m ~ 18.0 for the Green et 
al. survey). This indicates that photometric carbon star surveys are highly 
incomplete. 

We have obtained recent epoch CCD frames of most of the HES carbon 
stars. Comparison with archival plate material available online is currently 
being done to derive proper motions, and identify halo dwarf carbon stars in 
our sample. 



3.3 White dwarfs 

One exciting application for the hundreds of new white dwarfs that were 
found in the HES is testing the double-degenerate (DD) scenario for type la 
supernoave (SN la) progenitors, in which a binary system, consisting of two 
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white dwarfs of large enough mass, merges and produces a thermonuclear 
explosion. Although a couple of DDs were identified by radial velocity (RV) 
variations, past efforts have failed so far to identify any SN la progenitor 
systems among the DDs, which is being attributed to too small sample sizes 
(Maxted and Marsh 1999). In a Large Programme approved by ESO (P.I.: 
Napiwotzki), we use VLT-UT2/UVES to observe WDs at two randomly cho- 
sen epochs, to find more DDs. We aim at observing a total set of ~ 1500 DAs 
and DBs selected in the HES data base, and taken from the literature. With 
a set of 224 spectra of 107 WDs processed so far, 15 objects with RV varia- 
tions were found (Napiwotzki 2000, priv. comm.). Follow-up observations to 
determine the orbital periods for these systems will be carried out in the near 
future. 
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